A tabular approach to the sequence-to-structure relation in proteins (tetrapeptide representation) for de novo protein design.
نویسندگان
چکیده
BACKGROUND Experimental observations classify the protein-folding process as a multi-step event. The backbone conformation has been experimentally recognized as responsible for the early-stage structural forms of a polypeptide. The sequence-to-structure and structure-to-sequence relation is critical for predicting protein structure. A contingency table representing this relation for tetrapeptides in their early-stage is presented. Their correlation seems to be essential in protein-folding simulation. MATERIAL/METHODS The polypeptide chains of all the proteins in the Protein Data Bank were transformed into their early-stage structural forms. The tetrapeptide was selected as the structural unit. Tetrapetide sequences and structures were expressed by letter codes. The transformation of a contingency table of any size (here: 160,000x2401) to a 2x2 table performed for each non-zero cell of the original table allowed calculation of the rho-coefficient measuring the strength of the relation. RESULTS High values of the rho-coefficient extracted sequences of strong structural determinability and structures of high sequence selectivity. The web-site program to calculate the rho-coefficient ranking list was constructed to enable applying this method to any problem of contingency table analysis. CONCLUSIONS The results revealed sequence-to-structure (and vice versa) correlation in early-stage folding. Surprisingly, the irregular structural forms of loops and bends appeared to be highly determined. Comparison of these results with another method based on information entropy revealed high accordance. The method oriented on interpretation of a large contingency table seems very useful especially for large-scale microarray analysis, a very popular technique in the post-genomic era.
منابع مشابه
Designing a new tetrapeptide to inhibit the BIR3 domain of the XIAP protein via molecular dynamics simulations
The XIAP protein is a member of apoptosis proteins family. The XIAP protein plays a central role in the inhibition of apoptosis and consists of three Baculoviral IAP Repeat domains. The BIR3 domain binds directly to the N-terminal of caspase-9 and therefore it inhibits apoptosis. N-terminal tetrapeptide region of SMAC protein can bind to BIR3, inhibit it and subsequently induce apoptosis. In th...
متن کاملOn solving possibilistic multi- objective De Novo linear programming
Multi-objective De Novo linear programming (MODNLP) is problem for designing optimal system by reshaping the feasible set (Fiala [3] ). This paper deals with MODNLP having possibilistic objective functions coefficients. The problem is considered by inserting possibilistic data in the objective functions coefficients. The solution of the problem is defined and established under the using of effi...
متن کاملDesign and Production of Recombinant TAT Protein Structure, Catalytic Domain of Diphtheria Toxin, and Evaluation of Its Effect on Cell Line
Background and Objectives: Cancer is one of the most deadly diseases in the present age and its conventional therapies have had low success. Toxin therapy of cancer is a new therapeutic approach, which has attracted the attention of pharmaceutical specialists. Diphtheria toxin consists of three functional, transducing, and binding domains, that the functional part inhibits protein synthesis and...
متن کاملConsidering Uncertainty in Modeling Historical Knowledge
Simplifying and structuring qualitatively complex knowledge, quantifying it in a certain way to make it reusable and easily accessible are all aspects that are not new to historians. Computer science is currently approaching a solution to some of these problems, or at least making it easier to work with historical data. In this paper, we propose a historical knowledge representation model takin...
متن کاملAb initio prediction of the three-dimensional structure of a de novo designed protein: a double-blind case study.
Ab initio structure prediction and de novo protein design are two problems at the forefront of research in the fields of structural biology and chemistry. The goal of ab initio structure prediction of proteins is to correctly characterize the 3D structure of a protein using only the amino acid sequence as input. De novo protein design involves the production of novel protein sequences that adop...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Medical science monitor : international medical journal of experimental and clinical research
دوره 12 6 شماره
صفحات -
تاریخ انتشار 2006